Path-Phase Information Complementarity for Interfering Particles through 

State-Discrimination 
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We analyze the trade-off between the amounts of information obtainable on complementary prop- 
erties of a qubit state by simultaneous measurements. We consider a "state discrimination" scenario 
wherein the same measurements are repeated, but the input states must be guessed in every run. 
We find a general complementarity relation for path-phase guesses by any generalized measurements 
in this scenario. The counterpart of this input-output mutual information (MI) reveals a hitherto 
unknown aspect of complementarity. 

PACS numbers: 03.67.-a, 42.50.Dv, 42. 50. Ex 



As is well known, the measurement of one observable 
"disturbs" a complementary observable, i.e., introduces 
uncertainty in it. A complementarity or duality relation 
has been derived[l|, Qand experimentally verified[3]for 
Hilbert space of dimensionality 2. This relation quantifies 
path predictability versus fringe visibility of a particle in 
a Mach-Zehnder interferometer (MZI) with a partly effi- 
cient which-path detector (Fig. QJi ).This relation reads: 



D z + V z < 1 



(1) 



The path distinguishability, D, is related to the which- 
way probability, P ww , of guessing the path correctly for a 
known input state and a which- way detector of efficiency 
(reliability) E < 1, while the fringe visibility, V, is re- 
lated to the which-phase probability, Vwp, of guessing 
correctly which MZI port the particle will exit through 
(for an optimal choice of the phase between the arms) 



Vww — — ^ — > ^wp — — - — • 



(2) 



Yet, in this setup, the WW and WP probabilities refer 
to two alternative measurements [5J. Indeed, Vwp is the 
probability of predicting correctly where the particle will 
exit (inset (a) of Fig. QJi ).By contrast, ?ww is opera- 
tionally meaningful only in a measurement (inset (b) of 
the figure). Q]) where the exit beam splitter of the MZI is 
removed, because only then can the readout of the partly 
efficient WW detector be verified. Thus, in the scheme 
of Fig. [TJ simultaneous guesses of path and phase cannot 
be verified or falsified in the same predictive experiment, 
i.e., either V\vp or Vww must represent a counterfactual 
probability. We may think of this "predictive" duality as 
a constraint on the optimal strategies in a single-player 
game, in which the player (Bob) knows the initial state 
and the experimental setup and tries to guess the out- 
come of each measurement. 

Is it possible to obtain a duality relation for path and 
phase information such that both have simultaneous op- 
erational meaning in each experimental run? As we show, 
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Figure 1: Setup for predictive complementarity game. 
The initial state is specified by a phase delay cj> and 
transmissivity T at BS1. Bob can either measure the 
phase (inset a), or the path (inset b). Faraday rotator 
(FR) serves to correlate the path with the polarization. 



such a relation can indeed be given in the context of quan- 
tum state discrimination, namely, measurements aimed 
at optimallyguessing the initial state out of a set of pos- 
sible states[6].In contrast to the "predictive" WW-WP 
duality, the proposed "retrodictive" duality described be- 
low is a bound on the optimal strategies in a two-player 
game: the guessing by Bob which of the several alter- 
native input states had been prepared by Alice prior to 
the one measurement Bob performed. The precise rules 
of the game for this state-discrimination scenario are as 
follows: 

(1) Alice randomly chooses to prepare the particle in 
one of the four input states (Fig. [2aJ) : 

|&ww, frw P )a,0 = T(b ww a)\A) + e bwpi<# T(-6 ww a)|S) (3) 

Here \A,B) are the path states (represented by qubit 



states \<r z = ±1}), T(±b ww a) 



cos 



(f ± 



the cor- 
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responding amplitudes, and e 6wp4 * their relative phase 
factor. The four input states correspond to the choices 
of the parameters (6 WW = ±1, b wp = ±1). 

(2) Bob receives the qubit, and after performing a mea- 



2 




Figure 2: (a) Bloch representation of Alice's four 
alternative WW-WP input states (labeled by 6 WW and 

b-wp), with distances d ww and d wp . (b) Bloch 
representation of 8 WW-WP-WM input states, (c) VN 
scheme (tunable output BS). (d) Multiple inefficient 
detector scheme for mixed state discrimination. 



surement of his choice, tries to guess the values of the two 
bits 6 ww ,6 wp (which are statistically independent), i.e., 
guess which of the four possible input states was chosen 
by Alice. 

For a given choice of Alice's parameters {a,<fi), each 
strategy that Bob adopts yields probabilities Pww and 
Pwp to correctly guess 6 WW and 6 wp , respectively (Roman 
font is henceforth used for Pww and Pwp , as opposed to 
the calligraphic font for "predictive" probabilities V W w 
and P W p above). Bob's strategy is "Pareto optimal" Qif 
there is no other strategy that yields a pair of probabil- 
ities (Pww, Pwp) such that one is strictly better and 
the other not worse than its counterpart. The set of all 
optimal pairs is called the "Pareto Frontier". 

The optimal probability of distinguishing two states 
of a qubit with density operators pi and p 2 is given 
by 1/2 plus their trace distancejl]: cfoaceGoi, pi) = 
iTr|pi — p 2 \. This is half the Euclidean distance be- 
tween the corresponding Bloch vectors. Let us define 

dww = ^Trace(|^ww — ~t~l)^wp); |^ww — l)^wp));<^wp = 

dTracc(|&ww, b wp = +l),|fe W w,^w P = -1))- Here d ww de- 
notes the which-way distinguishability of the set of in- 
puts, while d wp is the which-phase distinguishability. 

The Bloch vector corresponding to the input state 
b" 1 = |&ww,&wp)at,0 m Eq. I© is: 

= rf x + b wp d wp y + b ww d ww z, (4) 

where do — cos a cos 0; dw W = sin a; dw P = cosasin</>. 
The set of Alice's allowed input states form a rectangle 



on the Bloch Sphere with dimensions given by 2<i ww and 
2d wp . (Fig. HiJ). 

Bob is allowed to perform generalized measurements 
(POVMs), as well as projective (Von Neumann) ones, by 
letting the particle interact with an ancillary system (an- 
cilla) and then performing a projective measurement on 
both together (the WW detector in Fig. [T]is just such an 
ancilla). Any POVM can be represented by a set of oper- 
ators, {Ai} i=1 N , satisfying|9j A4 > 0, A] = Ai,J2i A % 
I Operationally, this means that when performing the 
corresponding generalized measurement on a system ini- 
tially in state p, the ith "measurement" outcome appears 
with probability Tr {Aip}. 

Lemma A POVM {Aj} on a qubit can be described 
as a collection of weighted points in the Bloch Ball: Aj = 
fXj 1+ 2 ° ; with pj > 0, ||Rj|| < 1. Furthermore, every 
such POVM has a "refinement" consisting of weighted 
points on the Bloch Sphere (||Rj|| = 1), such that all 
information on the original POVM can be retrieved from 
it. Thus, without loss of generality, we shall assume a 
representation of the latter form. The proof of this lemma 
is given in the Supplement. 

By assumption, all inputs b|" are equally probable: 
Pi = \. The joint input-output distribution is then: 
Pi,j = PiTr {piAj} = i |-(l + do^j +«wp2/jrfwp+«ww^rfww), 
where Rj = (xj , yj ,Zj). 

Theorem .1 For any POVM, the WW and WP proba- 
bilities satisfy: 

/2Pww-iy + /2PwP-iy^ (5) 

Equality holds iff all the Bloch vectors have the form 
Rj = (0, ±yT — Zq, izo), with zq £ [0, 1], and the corre- 
sponding weights p±.± satisfy: p++ = = /i, p^ = 

p — \- = 1 — p for some p € [0, 1]. 

Outline of proof As shown in detail in the Supple- 
ment, the probability of correctly guessing 6 WW is given 
by: 

Pww = ^2 max {Pi„„,j,P-i„ v ,j} = 2 ( x + X! ^2 l z J'l dww 

(6) 

Pwp-^(l + Eyl%l d wp)- (7) 



Combining these two equations, we have 




(8) 

■ 

In what follows we shall restrict ourselves to measure- 
ments (POVMs) which are Pareto optimal, unless stated 
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otherwise. Two particular instances of this optimal class 
are of special interest. The case // G {0, 1} describes a 
Von-Neumann measurement in the x — z plane (this can 
be realized by choosing the phase delays of the output 
beam splitter of the MZI appropriately). The case /Lt = i 
describes the measurement with a WW detector, as in 

Fig. rjji. 

For the WW-detector assisted measurement (Fig. QJ, 
), the probabilities are determined by the detector ef- 
ficiency, E: z = E. Likewise, in the Von-Neumann 
measurement corresponding to an output BS with trans- 
missivity \ < Tout < 1 and correctly chosen input phases 
(Fig. 123, z Q = l-2T 0u t. 

The (unique) measurement minimizing the overall 
probability of error (in guessing both path and phase) 
is also Pareto optimal: 

Theorem .2 A POVM that maximizes the probability of 
guessing the path and phase simultaneously is necessarily 
WW- WP Pareto- optimal. 

Proof: A similar calculation to that used in the proof 
of Theorem fTJ gives the probability of correctly guessing 
the input state, P c — \ (Pww + Pwp — \)- Clearly, in- 
creasing one of the partial probabilities without reducing 
the other implies improving the average ■ 

WW- and WP- Information complementarity. 
These results can be recast in information-theoretic 
terms. Let b ln = (b wv/ ,b wp ) be the two random (sta- 
tistically independent) bits Alice chooses for her input 
state, as before, and denote the observables measured by 
Bob by b out . This set of classical stochastic variables 
(albeit related by a quantum channel) allows the defini- 
tion of the WW(WP)- output information as the mutual 
information between 6 ww ( wp ) and b out : Iww = I(bww : 
b out ); I WP = I(b wp : b out ). 

To separate out the WW correlation explicitly, we in- 
troduce a new stochastic variable constructed out of the 
fundamental ones: 



H(b W w) ~ H(WW). From this last equation (and its WP 
analog) follow the intuitively appealing relations: 

Iww = 1 — ^(Avw), %p = 1 — H2{Pwp)- (10) 

We note that Jww and Jwp are monotonically in- 
creasing functions of P ww and P wp , respectively (be- 



cause P 



u W) Pwp 



G [5,1])- Thus, the Pww-Pwp com- 
plementarity of Eq. J5]) implies a similar trade-off for 
Avw and Jwp- The amount of information about Al- 
ice's input settings contained in Bob's measurement re- 
sults is given by the mutual information between them: 
-fin-out = I(h m : b out ). The relation between im-out and 
Iww, Avp is given by the following: 



Theorem .3 



In — r 



'WW 



'WP 



'WWWP- 



(11) 



Proof: (1) For the important special case /1 = \ (WW- 
detector scheme), the output observables also consist of 
two bits b out = (b^ : b w u p ), which turn out to be statis- 
tically independent. As the notation suggests, the WW 
output bit (obtained from the reading of the WW detec- 
tor) has non-zero mutual information with b ww , but not 
with 6 wp , and conversely for the WP output bit. From 



I(b 
H(b 



.out 
WW j u ww 

JOUt^ 
WW 5 U WW , 



b b out ^ - 



Avp ) "wp I 

+ H(b 



wp 1 "wp J 



H{bw-w 5 ^wp 5 ^ww ? 



and 



H(b ww , O = H(WW) + 1, H(b wp , 6° u p) = H(WP) + 1 
H(b ww , b wp ; Cw, O = H(WW, WP) + 2, (13) 

we obtain: 

Kb b out ■ b b out ) = 

1 V U WW5 U WW ' ^Wpj "wp / 

H{WW) + H(WP) - H(WW, WP) = I(WW : W\P$) 



WW = 



1 P(6 ww ,b° l 
otherwise 



E ) > P(6 WW) b out ) 



(9) 



where P(6 WW , b out ) is the apriori probability for 6 WW , b out 
to take the values which actually occurred. We see from 
Eq. © that Pww = P{WW = 1). This also implies 
H(WW) = P 2 (Pww), where H 2 (x) = H{x, 1 - x) is the 
binary entropy. We define an analogous variable WP, 
such that P W p = P{WP = 1). 

Now, given the value of b out , b ww and WW determine 
each other, so that they are interchangeable, in the joint 
entropy P(& ww ,b out ) = H(WW, b out ). Since for opti- 
mal measurements, WW is stochastically independent of 
b out , as can be seen from Pww = \ (1 + 2o4w) (see Sup- 
plement), it follows that H(b ww , b out ) = H(WW, b out ) = 
H(WW) + H(b out ). By definition, the mutual informa- 
tion 7(& ww : b out ) = H{b ww ) + P(b out ) - H(b ww , b out ) = 



Comparing 



I{b 



H{b 



WW) ^wp ■ "u-w ' "wp 

11 rww 1 "wp , 



H{b ww , b W p] ^ ww 1 



with Eq. (fl2| . and using the independence of b wvl ,b 



w p 



and of 6°^, 6°p* to decompose: P(6 WW , b wp ) — H(b ww ) + 
H(b wp ), P(Cw^wp) = H(blt) + #(0= we S et E( i- 

CD- 

The third term, Iww-.wp, is a novel corollary of our 
treatment. This "cross-information"has its origin in the 
fact that, although 6 WW is independent of b wp and of 
6°p* , and 6°^ is also independent of 6 wp and of b° , 6°^ 
and 6 WW together tell us something about 6° p * and 6 wp . 
Namely, a correct (incorrect) WW guess implies lower 
(higher) probability of correct WP guess. If Bob is al- 
lowed to bet on WW first, and is then told the outcome, 
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this will not change the value of 6 wp he bets on, but will 
affect the odds of his guessing correctly! This is a hitherto 
unnoticed subtle form of complementarity. 

(2) In the general case (// 6 [0, 1]), h out does not de- 
compose neatly into WW and WP parts, and proceed- 
ing as above, one ends up with the relation: iin-out = 
Iww + Avw+ [/ (&ww, b out : 6 WP , b out ) - H (b out )]. The 
expression in the square brackets is known as the condi- 
tional mutual information: I (&ww : frwp|b out ) and is equal 
to Jww:WP) as required. ■ 




(a) (b) 



Figure 3: (a) Illustration of (fTTIl parametrized by the 
detector efficiency, E in a WW-detector scheme (input 

parameters: d ww — .65,d wp = .6). (b) Pareto optimal 
surface with color representing the magnitude of lin-out 

(input parameters: d ww = -65, d wp = .6, d wm = 0.5). 



To conclude, while the "predictive" distinguishability- 
visibility duality holds for two alternative measurements 
(itself a manifestation of complementarity), our state- 
discrimination scenario allows for only one type of mea- 
surement: we have more preparations, instead. This sce- 
nario yields the generalized complementarity and mutual 
information (MI) relations and (fTTj) . and their mixed 
input generalization, lfl~7|) and (fl8l) . These results show 
that complementarity may be reformulated as an infor- 
mation tradeoff obtained on complementary properties 
(path and phase) in a single measurement. The struc- 
ture of this information tradeoff is richer than previously 
thought, as it allows for "cross information": information 
gained on the odds of WW guess given the WP result 
(or vice versa) . Since the goal of this work is to allow for 
simultaneous WW and WP MI, our scenario is not the 
sequence-reversed version of standard predictive duality. 
However, the latter scenario is of great interest to quan- 
tum cryptography As shown in one can derive 
Pareto-optima for the latter problem from those of the 
present one quite simply. Hence, the tight bound on MI 
in (fTTIl . is expected to be useful for improving the 
corresponding one in the cryptographic setting. 

We acknowledge the support of the EC, GIF and ISF. 



Complementarity for mixed states The geome- 
try of the trace-distance suggests the following general- 
ization of the problem to 3 independent input parame- 
ters for mixed states. We now consider 8 input states 
P6 ww= ±i,& wp =±i,b wm =±i corresponding to the Bloch vec- 
tors: 

**6 WW ,6 wp .fe wm — d()X ^wm^wm^- ^wpdwpY "i~ ^ww^ww^j 

(16) 

where d wm is the mixing distance (a measure of im- 
purity), defined similarly to d ww and d wp , and the 
distances satisfy: d^, w + d^, p + a% m < 1 and do = 

^Jl — (d^ w + ^wp) — 4m. The set of all eight states is 
now a rectangular box within the Bloch Ball (Fig. I2b|) . 
The Pareto frontier is now an ellipsoid: 

/ 2P W w-l \ 2 / 2fW P -l \ 2 | / 2P W M-1 \ 2 = 1 

\ dww J \ ^wp / \ dwxn ) 

(17) 

Schemes for path-phase-"mixedness" complementarity 
are shown in Fig. [2c] and [2d] (see supplement). The 
input-output mutual information for optimal schemes is: 

-fin-out = %W + AvP + IwM + CwW:WP:WM- (18) 

Here Cww:WP:WM is the "total correlation" C(WW : 
WP : WM) defined by: C(X,Y, Z) = H{X) + H(Y) + 
H(Z) — H(X, Y, Z). It is the mixed-state counterpart of 
the novel cross-information term Avw:WP in (fTT]l (see 
discussion following (fT5]) ), 
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Supplement 



Lemma 1 above. In this notation we have: 



Lemma A POVM {Aj} on a TLS, can be described 
as a collection of weighted points in the Bloch Ball: Aj — 
fij 1+ ^'' 17 , with fXj > 0, ||Rj|| < 1. Furthermore, every 
such POVM has a "refinement" consisting of weighted 
points on the Bloch Sphere (||Rj| = 1), such that all 
information on the original POVM can be retrieved from 
it. 

Proof: Any hermitian operator A on a qubit's Hilbert 
space is a linear combination of the Pauli matrices and 
the identity: A = ^T + R • <x. If A is further required to 
be positive, then we must have ||R 

from the requirement ^R • a = — 1 

Hence A, = fJ, i 1+ ^ i " T with m > 0, 
definition, in the Bloch ball. The condition that the oper- 
ators sum to the identity implies 5 ^ Mi — 1; Mi r « = 0- 
Also notice that every POVM has a "refinement" consist- 
ing of weighted points on the Bloch Sphere, such that 
all information on the original POVM can be retrieved 
from its refinement. The refinement is obtained by re- 
placing each operator Ai whose Bloch vector lies inside 
the ball ((/x,, R,) , |Rf| < 1), by the pair {A, }, where 



< 77, as can be seen 



IT 



R 

I < 1 



= -1^ > 0. 
which is, by 



.4 



, ±R 7 ; 



Theorem □] For any POVM, the WW and WP prob- 
abilities satisfy: 



/ 2Pvvw — 1 \ 2 ( 2-Pwp — 1 x " 



< 1 (19) 



Equality hol ds iff t he Bloch vectors all have the form 
Rj = (0, ± \J 1 — Zq , ±zo), with the same zq S [0, 1], and 
the corresponding weights fi±,± satisfy: = = 

/j, /i^ = /i |_ = 1 — fj, for some /i £ [0, 1]. 

Proof: Let i = 1 ... 4 stand for each possible 
(iww, v) pair, as described in Eq. ([3]). By assump- 
tion all inputs i are equally probable, and we use the 
notation: Pi — j For the joint input-output distribution: 
Pi,j = piTr {piAj}, we denote the marginal output prob- 
ability by: pj — J^iPij- We define the joint probabil- 
ity between the output state and the first input bit (the 
'WW bit') as, Pi ww ,j = Y^i -P(«ww,iwpb an d similar for 
Pi 2 j . We define conditional probabilities such as p iww u 
and Pi ww | j2 as well. We use 



(20) 



for the Bloch representation of the i-th input state, and 
Rj = (xj, yj, Zj) for that of Aj (the operator correspond- 
ing j-th measurement outcome) , which is possible due to 



Pi.) 



q (1 H~ d{)Xj -\- ^wpl/j^wp ~r~ ^ WW %j ^ww ) 5 (21) 



(22) 



Given that an output j has occurred, the most probable 
value, z ww , of b ww is that for which Pi ww j > P-i„„,j The 
special case when these joint probabilities are equal needs 
to be treated slightly differently, but the results for the 
generic state can be shown to apply to it by continuity, 
and we shall not treat it here. From this follows that the 
total probability for correct WW inference is 



Pww = ^ max {p lww j , p~i vv j } . 



(23) 



ByEq. (|2Tj). 



max{p iwwJ ,p_ iwW!i } = -^(1 4- d Xj + \zj\d ww ), (24) 



therefore 



Pww = \ (l + X^T-l^l^ 



(25) 



2 V ^2 

where we have used \ Mj ' — 1; x j = 0- Similarly, 

Avp-i(i + Eyl%l d wp)- (26) 

Combining these two equations, we have 

2 P\\ \\ - I x \ " . ('21 \\ |.' — 1 \ 2 

(27) 



The last inequality follows from Jensen's inequality (for 
the concave function /(x) = x 2 ). ■ 



Let us now characterize the Pareto- optimal POVMs. 
The inequality becomes an equality iff \zj\ — const. = 
Zq, \yj\ = const. = i/o,Zq + Uq = 1. Clearly, we are free 
to choose yo £ [0,1], and let zq = yl — Vq ■ Then the 
Bloch vectors Rj can take the values (0, ±yo, ±zo) ; with 
corresponding weights i*±,±- The condition Yl^j^-j = 

implies = fx , fx^ = fi |_ while Ylh^o = ^ 

implies tt ++ + = 1. Therefore, the optimal POVMs 

are represented by rectangles residing on a great circle 
in the y — z plane, with sides parallel to the rectangle of 
input states, and weights satisfying: 



Mi m+- = m-+ = 1 -m; [0,1]. (28) 
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Figure 4: Comparison of the exact tradeoff given by Eq. 
(fT9|l (solid line) with the one given by Eq. <[30|) (dashed); 
the dot-dashed line shows -An— out — iww' the gap 
between it and the solid line is the contribution of 
Jww:WP to hn-out- All of these relations generalize 
naturally to the three dimensional case 
(iww + Iwp + Iwm)- 



For the optimal POVMs, Eqs. 



simplify to 



Pww = 7; (1 + z d ww ) , P W p = 7; (1 + yo^wp) • (29) 
Bounds on iww + iwp 



We note that by Holevo's theorem[8| ii n _ ou t < 
S(J2PiPi) -J2PiS(Pi) = H2(^j £L ), where S denotes the 
Von-Neumann entropy, and p, ,Pi denote the ith initial 
state and its a priori probability, respectively. However, 
this bound is not tight in the present situation. Together 
with Eq. ifTTj) . this implies the following bound: 



'ww 



iwp < H 2 { — — -), 



(30) 



where do = yl (°^ww + ^wp)- As mentioned there, 
this bound is not tight. However, as explained following 
Eq. (flUI , the Pww — Avp Pareto-frontier is also the fron- 
tier for iww — iwPj as the is are monotonic functions of 
the Ps. Thus, Eq. (fl"9| implicity gives the exact tradeoff 
between the is. 

WW-WP-WM experiments 

The Von Neumann projective measurement depicted in 
Fig. [2c] can roam on the entire ellipsoid, provided the in- 
put phase on the output BS be tunable, as well as its bias. 
Conversely, any output beam splitter is Pareto-optimal. 
The WW detector generalizes to a two detector scheme, 
with detectors with efficiencies E\,Eh (Fig. [2d| . yield- 

i" g: = Eu ^ 



— \/l — E\E%, 



